skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Wang, S"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Accurate simulation of earthquake scenarios is essential for advancing seismic hazard analysis and risk mitigation strategies. At the San Diego Supercomputer Center (SDSC), our research focuses on optimizing the performance and reliability of large-scale earthquake simulations using the AWP-ODC software. By implementing GPU-aware MPI calls, we enable direct data processing within GPU memory, eliminating the need for explicit data transfers between CPU and GPU. This GPU-aware MPI achieves nearly ideal parallel efficiency at full scale across both Nvidia and AMD GPUs, leveraging the MVAPICH-PLUS support on Frontier at Oak Ridge National Laboratory and Vista at the Texas Advanced Computing Center. We utilized the MVAPICH-Plus 4.0 compiler to enable ZFP compression, which significantly enhances inter-node communication efficiency – a critical improvement given the communication bottleneck inherent in large-scale simulations. Our GPU-aware AWP-ODC versions include linear forward, topography and nonlinear Iwan-type solvers with discontinuous mesh support. On the Frontier system with MVAPICH 4.0, Hip-aware MPI calls on MI250X GPUs deliver nearly ideal weak-scaling speedup up to 8,192 nodes for both linear and topography versions. On TACC’s Vista system, CUDA-aware MPI calls on GH200 GPUs substantially outperform their non-GPU-aware counterparts across all three solver versions. This poster will present a detailed evaluation of GPU-aware AWP-ODC using MVAPICH, including the impact of ZFP message compression compared to the native versions. Our results highlight the importance of Mvapich support for GPU-ware MPI and on-the-fly compression techniques for accelerating and scaling earthquake simulations. 
    more » « less
    Free, publicly-accessible full text available August 20, 2026
  2. We have implemented GPU-aware support across all AWP-ODC versions and enhanced message-passing collective communications for this memory-bound finite-difference solver. This provides cutting-edge communication support for production simulations on leadership-class computing facilities, including OLCF Frontier and TACC Vista. We achieved significant performance gains, reaching 37 sustained Petaflop/s and reducing time-to-solution by 17.2% using the GPU-aware feature on 8,192 Frontier nodes, or 65,336 MI250X GCDs. The AWP-ODC code has also been optimized for TACC Vista, an Arm-based NVIDIA GH200 Grace Hopper Superchip, demonstrating excellent application performance. This poster will showcase studies and GPU performance characteristics. We will discuss our verification of GPU-aware development and the use of high-performance MVAPICH libraries, including on-the-fly compression, on modern GPU clusters. 
    more » « less
    Free, publicly-accessible full text available September 10, 2026
  3. To efficiently predict flooding caused by intense rainfall (pluvial flooding), many physics-based flood inundation models adopt simplistic parameterizations of infiltration such as the Kostiakov, Horton, Soil Conservation Service and Green-Ampt methods. However, these methods are not explicitly dependent on soil moisture (or the groundwater table height), which is known to strongly influence the amount of runoff generated by rainfall. Models that fully couple surface and groundwater flow equations offer an alternative approach, but require larger amounts of input data and greater computational effort. Here we present a fast flood inundation model that couples two-dimensional shallow-water equations for surface flow with a zero-dimensional, time-dependent groundwater equation to capture sensitivity to groundwater. The model is also configured to account for storm drains, pumping and gates so human influences on flooding can be resolved, and is implemented with a dual-grid finite-volume scheme and with OpenACC directives for execution on graphical processing units (GPUs). With a 1.5 m resolution application across a 1,000 km area in Miami, Florida, where pluvial flooding is sensitive to depth to groundwater and simulation models that accurately reproduce observed flooding are needed to explore and plan response options, we first show that hourly water levels are predicted with a Mean Absolute Error of 8–16 cm across six canal gaging stations where flows are affected by tides, pumping, gate operations, and rainfall runoff. Second, we show high sensitivity of flooding to antecedent groundwater levels: flood extent is predicted to vary by a factor of six when initial depth to groundwater is varied between 10 and 200 cm, an amount that aligns with seasonal changes across the area. And third, we show that the model runs 30 times faster than real time (i.e., model speed = 30) using an NVIDIA V100 GPU. Furthermore, using a 3 m resolution model of Houston, Texas, we benchmark model speeds greater than 20 and 100 for domain sizes of 10,000 or 1,000 km2, respectively. The importance of model speed is discussed in the context of flood risk management and adaptation. 
    more » « less
    Free, publicly-accessible full text available July 1, 2026
  4. Free, publicly-accessible full text available May 19, 2026
  5. Free, publicly-accessible full text available April 21, 2026
  6. Free, publicly-accessible full text available May 3, 2026
  7. Laskin, J; Ouyang, Z (Ed.)
    Chirality effects on the intrinsic gas-phase acidity of oligopeptides have been studied using a pair of stereoisomeric tripeptides consisting of a D/L-cysteine (C) and two residues of alanine (A): CAA and dCAA, where the C-terminus is amidated. Mass spectrometry measurements through bracketing via collision-induced dissociation clearly show that CAA is a stronger gas-phase acid than dCAA. Quantitative values of the acidity were determined using the extended Cooks kinetic method. The resulting deprotonation enthalpy (∆acidH) for CAA is 326.2 kcal/mol (1364.7 kJ/mol) and for dCAA it is 326.8 kcal/mol (1367.6 kJ/mol). The corresponding gas-phase acidity (∆acidG) for CAA is 321.3 kcal/mol (1344.2 kJ/mol) and for dCAA it is 322.0 kcal/mol (1347.3 kJ/mol). Changing the N-terminal cysteine from the L-form to the D-form reduces the gas-phase acidity by about 0.6 kcal/mol (2.5 kJ/mol). Extensive conformational searches followed by quantum chemical calculations at the ωB97X-D/6-311+G(d,p) level of theory yielded a set of lowest energy conformations for each peptide species. Theoretical gas-phase acidities calculated using the Boltzmann averaged conformational contributions are in good agreement with the experimental data. The shift in the acidity is likely due to the conformational effect induced by D-cysteine, which increases the stability of the neutral dCAA, and hence reduces its acidity. A chirality change on a single amino acid can have a noticeable effect on the biochemical properties of peptides and proteins. 
    more » « less
    Free, publicly-accessible full text available May 20, 2026
  8. Free, publicly-accessible full text available May 1, 2026
  9. Free, publicly-accessible full text available April 25, 2026
  10. Artificial Intelligence (AI) has demonstrated significant potential in healthcare, particularly in disease diagnosis and treatment planning. Recent progress in Medical Large Vision-Language Models (Med-LVLMs) has opened up new possibilities for interactive diagnostic tools. However, these models often suffer from factual hallucination, which can lead to incorrect diagnoses. Fine-tuning and retrieval-augmented generation (RAG) have emerged as methods to address these issues. However, the amount of high-quality data and distribution shifts between training data and deployment data limit the application of fine-tuning methods. Although RAG is lightweight and effective, existing RAG-based approaches are not sufficiently general to different medical domains and can potentially cause misalignment issues, both between modalities and between the model and the ground truth. In this paper, we propose a versatile multimodal RAG system, MMed-RAG, designed to enhance the factuality of Med-LVLMs. Our approach introduces a domain-aware retrieval mechanism, an adaptive retrieved contexts selection, and a provable RAG-based preference fine-tuning strategy. These innovations make the RAG process sufficiently general and reliable, significantly improving alignment when introducing retrieved contexts. Experimental results across five medical datasets (involving radiology, ophthalmology, pathology) on medical VQA and report generation demonstrate that MMed-RAG can achieve an average improvement of 43.8% in factual accuracy in the factual accuracy of Med-LVLMs. 
    more » « less
    Free, publicly-accessible full text available April 24, 2026